Trace Driven Analytic Modeling for Evaluating Schedulers for Clusters of Computers
نویسندگان
چکیده
Large enterprises use clusters of computers with varying computing power to process workloads that are heterogeneous in terms of the type of jobs and the nature of their arrival processes. The scheduling of jobs from a workload has a significant impact on their execution times. This report presents a trace-driven analytic model (TDAM) method that can be used to assess the impact of different schedulers on job execution times. The TDAM approach uses an implementation of the scheduler to schedule jobs that are fed into analytic models of the computers in the cluster. These analytic models use closed queuing network methods to estimate congestion at the various nodes of the cluster. The report demonstrates the usefulness of the TDAM method by showing how four different types of schedulers affect the execution times of jobs derived from well-known benchmarks. The report also demonstrates how the method can be applied to heterogeneous computer clusters such as the ones used to run MapReduce jobs.
منابع مشابه
MODELING FILE-SYSTEM INPUT TRACES VIA A TWO-LEVEL ARRlVAL PROCESS
A method for analyzing, modeling and simulating a two-level arrival-counting process is presented. This method is particularly appropriate when the number of independent processes is large. The initial motivation for this method was the need to analyze and represent computer file system trace data that involves activity on some 8,000 files. The method is also applicable to network trace data ch...
متن کاملACL 2 for Parallel Systems Software : A Progress Report
A significant development in high-performance computing has occurred in recent years with the proliferation of “Beowulf” clusters [6]. Beowulf clusters are parallel computers assembled from commodity-priced personal computers and networks. The explosive growth of the personal computer marketplace, together with rapid technological advances in the hardware sold there, has driven the price/perfor...
متن کاملFailure Analysis and Modeling in Large Multi-site Infrastructures
Every large multi-site infrastructure such as Grids and Clouds must implement fault-tolerance mechanisms and smart schedulers to enable continuous operation even when resource failures occur. Evaluating the efficiency of such mechanisms and schedulers requires representative failure models that are able to capture realistic properties of real world failure data. This paper shows that failures i...
متن کاملANP Application in Evaluating Ecological Capability of Range Management (Case Study: Badreh Region, Ilam Province)
Rangelands are important for plant productivity, livestock production, wildlife,conservation of soil and water resources, and etc. One of the main problem of rangeland isthat has not been used based on its potential that leads to more degradation of rangelands.The purpose of this study was to evaluate the range management capability of Badrehregion in Ilam province, Iran, using ANP (Analytic Ne...
متن کاملThe Importance of Feedback in Evaluating and Designing Parallel Systems Schedulers
An important goal of any parallel-system scheduler is to promote the productivity of its users. To achieve high productivity the scheduler has to keep its users satisfied and motivate them to submit more jobs. Due to the high costs involved in deploying a new scheduler, it is uncommon to experiment with new designs in reality for the first time. Instead, whenever a new scheduler is proposed, it...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014